CDS
Accession Number | TCMCG075C13443 |
gbkey | CDS |
Protein Id | XP_017974870.1 |
Location | join(24047296..24047317,24047399..24047517,24048784..24048846,24048998..24049043,24050079..24050297,24050555..24050667,24052748..24052916,24053063..24054152,24055211..24055413,24055563..24055800,24057931..24058267,24059413..24059622,24059799..24059891,24062035..24062241,24062422..24062628,24065852..24066338,24068186..24068394,24068537..24068653,24070304..24070360,24070445..24071575,24074235..24074597) |
Gene | LOC18602386 |
GeneID | 18602386 |
Organism | Theobroma cacao |
Protein
Length | 1899aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018119381.1 |
Definition | PREDICTED: transcription initiation factor TFIID subunit 1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGGTCACTCCAGCTTCCAAGATGAAGATGATGATGAAGAAGAATACGAGGAGGAAGCTAGTGGGGGTAATCGGCTTCTGGGTTTTATGTTTGGGAATGTCGATAATTCTGGTGATCTTGATGCTGATTATCTTGATGAGGATGCAAAGGAGCATCTTGCTGCGGTAGCAGACAAGCTGGGTCCATCTCTTACGGATATAGATCTGTCAGAAAAATCACCGCACACTCCTGCTGATGCTGCTGACCAAGATTATGATGAGAAGGCTGAAAATGCCATTGACTATGAAGATTTTGACGAGGAGTATGATGGTCCAGAGATTGAAGCTGCTACTGAGGAGGACCGTTTACTTCCCAAAAAGGAATATTTTACTGCTGATGTTTCTTCTGCTTTGGAGCCTAAATCCTCTGTATTTGATGATGAAAACTATGATGAGGATGAGGAAAGTGAAAAGGAACAAGAGGTGGTAGGTGAGCAGGAAGAGCGTACGGTACCATTTGAAGTAGAAAAATCTCCTGAGGATGATGTACAGTGTGGCTCTTCAGATAGTGAAAAACAAGCAGATGGTACTGAAGATTTTCAGGAAATAGCTGGTGTTCTGGAGGAGCCATTAGATGGTCAAGGTTCCACTCCTCTGCCTGTTTTATGTGTTGAAGATGGTATGGTAATCTTACGGTTTTCAGAAATATTTGGTATTCATGAACCCTTGAAGAAAGCAGATAAAAGAGAGCATGGCTATTTCACTCATAGAGAGAAATACAAATCTATGGATGCATCTGATCTTGTAGAAGAGGATGAAGAGGTATTTCTAAAGGGCACTGGTCAAGGGTTTTCATTCATTGGATGGGAAAATGCCATCCAACAAGATAGCCCTGAATTTACTGATGAACCTCTTGTGCAAGGAGGCTTGGCAATGTCTGCACACAATGAAGAACACATTAAGGATTCTTATTCTAGTCCAGAACCAATGAAAGAGGACATAGTAGTAAATATTTCTACTGGATGGCAGTCACCTTCATGTCCAAGATTTTTTGCCCTTGACCAGCTGGACTGGGAAGAACAAATTCTTTGGGACAATTCTCCTGCAATAAGTGGCGACTCTTTGGAGAGTCCTGAAATCTCTGTCTCTGATTTGGAAGCTTCAGTTGCAAGAGAGACCATTCCACAGACGGGGCAAAATATACTGTCTGAGCATTCTACAAAACCCTATGAGAAGGACCATGACAGCAGCCTTTGTAGTTCCTCTGTTTTTTTGGAACCGTTTGGCTCAAGAAATTCTTCAGGATCTATGGACCTTTCCTTCATGGACAGATTCCATCCCCAACTCTTAAGGTTGGAATCTCCGTTGGGAGTAGATTCTTCAAATGATGGTGATCATAAAAGAGAATATGTTACCATAGATACAGATAAAAGTGATGTTGTGAGGTGTTTTAACCAACTTACGTTGCAAAATAGAGATATGATGGAAGGGTCATGGTTAGACAATATTATATGGGAACCACATAGTGTGATTGCAAAGCCTAAGTTAATTCTTGATCTTCAAGATAAGCAGATGCTTTTTGAAATTTTTGATAACAAGGAAAGTAAACATCTTCAGCTCCATGCAGGGGCTATGATCATTACTCGACCTGTAAAGCCAAGCAGTCTTGGTTCTTCCGAGGTGTCAGGTCATAAATATCAACCTGGTTGGCAATTCAATATTGCTAATGACAAGTTCTATGTGAACCGTAAAGTTTCTCAACAGTTACAATCAAATTCTAATAAACGCATGGCTCATGGTGTCAGAGTTCATCATTCAGCACCTGCACTTAAGCTTCAAACAATGAAGTTGAAGTTGAGCAATAAAGATATAGCAAATTTTCACCGTCCAAGAGCGATATGGTATCCACATGATATTGAGGTGGCTGTCAGGCAACAAGGGAGGCTGCCAACACAAGGACCCATGAAAATTATATTGAAGAGCTTGGGAGGCAAAGGAAGCAAACTACATGTTGATGCTGAGGAAACTGTCTCTTCTGTTAAAGCAAAGGCTTCTAAGAAGCTAGATTTCAAGCCATTAGAAACAGTAAAAATGTTTTATCTTGGGAAGGACCTTGAAGATGATAAGTCTCTTGCTACCCAAAATGTTCAACCAAATTCTCTGCTTCATCTTATTCGCACAAGAATACATCTATTGCCAAGGGCACAAAAACTTCAAAGAGAAAATAAGTCTTTACGGCCTCCCGGGGCATTTAAGAAGAAATCTGATCTTTCTGTGAGAGATGGTCACATTTTTCTAATGGAGTATTGTGAGGAGAGGCCTTTACTTTTGAGCAATCCAGGGATGGGTGCAAACCTGTGCACTTACTATCGAAAAGCATCTTCAGGTGATCAAACTGGTGGATTGTTACGCAATGGAAATCAAACGTTGGGAAATGTTCTGCTTCTAGAGCCAGCTGATAAATCACCTTTCCTTGGAGATATTAAAGCCGGCTGTAGCCAGTCATCTCTTGAAACAAACATGTACAAGGCTCCCATATTTTCTCACAAGGTGCCATCAACAGACTTCCTATTGGTTAGATCAGCGAAGGGAAAGCTTTCCATACGACGTATTGATAAGATTGCTGTTGTCGGACAACAGGAGCCTCTCATGGAGGTAATTTCTCCTGGATTGAAGAATCTTCAGACATATCTCATGAATAGGATGTTGGTTTATGTGTATCGGGAGTTTTCTGCAGCTGCCAAGCGTGGTTTGACTCCATTCATTGGTACAGATGAGTTGTTTACTCATTTTCCTAACCTTTCAGATGCAATTGTCAGGAAAAAGCTGAAGGAATGTGCTTATTTGCGGAGGGATAAAAGTGGACGGCAGATTTGGTCCATGAAGCCTGATTTCCATATTCCACCCGAAGGTGTTCTGAGAAAGTTGGTCTTTCCGGAGCATGTGTGTGCCTATGAAAGCATGCAAGCTGGTCTGTACCGGCTCAAACATTTAGGAATCACACGACTAACACATCCTACCACTATTTCATCTGCAATGAGTCAACTCCCTGATGAAGCTATTGCTCTGGCTGCTGCATCACACATTGAAAGAGAGCTTCAGATAACTCCATGGAGTCTCAGTAGCAATTTTGTTGCATGTACAAGCCAGGACAGAGAATGTATTGAGCGCCTGGAAATAACTGGTGTCGGTGATCCTTCTGGTCGAGGCTTAGGATTTAGCTACGTTCGTACTGCTCCTAAGGCTCCAATGTCAAATGCTATGATGAAGAAGAAAACAGCTGCTGGTCGTGGAGGTTCTAGTGTAACTGGGACAGATGCTGACCTTCGTAGATTAAGCATGGAGGCAGCACGAGAGGTTCTTTTGAAGTTCAATGTTCCTGATGAAGTGATTGCAAAACAAACTAGGTGGCATCGTATTGCTATGATACGCAAGCTCTCAAGTGAGCAAGCTGCATCAGGAGTCAAGGTTGATCCGACAACTATCAGCAAATATGCACGGGGCCAACGAATGTCCTTTCTCCAGTTGCAACAGCAGACCAGAGAAAAATGTCAGGAAATATGGGATCGACAAGTTCAGTCTCTTTCAGCTGTTGATGGTGAAGAAAATGAAAGTGACTCTGAGGCAAATAGTGATCTTGATTCCTTTGCTGGTGACCTTGAAAATTTGCTTGACGCAGAGGAATTTGAAGAAGGGGAAGAGGTTAACAATGATAACAAGATTGACAAAGTGGACGGTGTTAAGGGGCTCAAGATGAGAAGGCGCCCCTCTAAAGCTCAGGCAGAAGAGGAAATTGAAGATGAAGCTGCTGAAGCTGCTGAATTATGTCGTTTACTCATGGATGATGATGATGAGCAAAAGAAGAAAAAGAAGAAGAAAAACAAAGCTGTTGCTGGAGATGTTGGGTTGAGTTTTGGCTTACAATCACGTATCAGCAGTGAGAATGTACAACGTGTTAAGAAAGCAAGCACGATTTCCAAGCAAATCGTTGGTGCTACTCAACCTAATGGGTCATATACTACGAACGAAAACATTGTTAAGGATCCAAAGGATATTGAAAGTAGAATGTTTAAAGGAAACTTGTCTGGAAAGGTGAAAGGAATGAAAAAGAATGGTATGTCAAGCACAGGTCCACTGACAAAAGTCAAAATATTGGGAGACAATGTTAAGCTGTTCAAGGAGAAGAAATCATCAAGAGAGACGTTTGTCTGTGGAGCTTGTGGTCAGCTTGGACACATGAGGACCAACAAGAACTGCCCCAAGTATGGAGAAGATCCAGAGTTACAGGTTGATGCTGCTGATTTCGAAAAACCTTCAGGAAAATCAACTTCGCTGGAACCCTCTGGTCTGTCCCAGTTGAAAACTATGAAAAAGAAGCTCATACCTAAAAGTGCGACAAAGATAGCTGTTGTAGAAGCTTCTGAGGGTGAAAAATCTAGTTCAAATGCAAAAGCTTTCCCATTGAAGTTCAAATGCTCTACTGATAGGCCTTCTGATAAACTTGCTTCTGGAGCCACACAAAGTTCTGACTATCAAGTTACTTCTGATCCAGAAAGTGGAATCAAGTCTGTTGCGAAGGTTAGTAAAATAATTATTTCCAACAGAGCAAAGCCTGATGAAATGCAGGTTGAGTCTCAGAAGCTGCCTTTCGTCATACGTCATCAAATGGATAATGATAGAGGGCAAGCTGAATCTCATAAGAGATCTATTGTTATACGGCCACCAACAAATATGGAGAGGGACCAAGTTGAACCTCATAAACCCTCTGTTGTCATACGCCCACCTGCAGATAAAGATAGAGAGCAACCTCACAAGAAAATTATAATCAAGCGACCAAAGGAGATTATTGATCTGGATCAGATTTGTCAGGAGGGAGGCACTTATCCTGAGTACAGGAAAACAAAAAAAATAGTTGAATTGTCAAGCTTTGAGAAGCATGGGAAGCAGGAAAGCTTGCGATTAACAGAGCAAACAGCAAGGCGCAAAGCCAAAGAGGAGAAAGGATGGTGGGAAGAGGAACAGAAGAGGAGAAATATGGAGATGCTAAGAGAAGAAAGGGCAAGGAAGCTTTATGAGGAAAAACTGAGAGCACTGGAAGAGCGAGAAAGATTTGCTGAGATCACAAGATACACAGAAGACATTCGGAGAGAGAGGGAGGAAGAAGAACGCCAGAAAGCGAAGAAGAAAAAGAAGAAGAAAAAAGCTGACATAAAAGACGATTATTTAGAAGACTACAGGACAAGAAGAAATGACAGAAGAACGCTGGAAAGAGACCGAGGTGCAAAAAGGAAACCGGTTGAGTTAGGACGATATGGTGCAGAGTATGTCCCTCCAACAAAGCGACGCAGAGGGGGAGAGGTTGGTTTGTCCAATGTCTTAGAGCGGATTGCAGAAACACTTAGGGATAATACCGAGTTGTCCTATCTGTTTCTAAAGCCAGTGTCCAAGAAGGAGGCTCCTGACTACCTGGATATCATAAAGCGCCCAATGGATTTGTCCACCATAAGGGACAAGGTTAGGAGGATGGAATACAAAGATCGAGAGGAATTTAGACATGATGTGTGGCAGATTGCCTATAATGCTCACATCTACAATGATGGTCGCAATCCAGGCATTCCTCCTCTTGCTGATCAGCTTTTGGAGCTTTGTGACTACTTGATGGATGAGTATCATCAGAGCTTGTGTGAAGCTGAAGCTGGAATAGAAGCTAGGGATACTTAA |
Protein: MGHSSFQDEDDDEEEYEEEASGGNRLLGFMFGNVDNSGDLDADYLDEDAKEHLAAVADKLGPSLTDIDLSEKSPHTPADAADQDYDEKAENAIDYEDFDEEYDGPEIEAATEEDRLLPKKEYFTADVSSALEPKSSVFDDENYDEDEESEKEQEVVGEQEERTVPFEVEKSPEDDVQCGSSDSEKQADGTEDFQEIAGVLEEPLDGQGSTPLPVLCVEDGMVILRFSEIFGIHEPLKKADKREHGYFTHREKYKSMDASDLVEEDEEVFLKGTGQGFSFIGWENAIQQDSPEFTDEPLVQGGLAMSAHNEEHIKDSYSSPEPMKEDIVVNISTGWQSPSCPRFFALDQLDWEEQILWDNSPAISGDSLESPEISVSDLEASVARETIPQTGQNILSEHSTKPYEKDHDSSLCSSSVFLEPFGSRNSSGSMDLSFMDRFHPQLLRLESPLGVDSSNDGDHKREYVTIDTDKSDVVRCFNQLTLQNRDMMEGSWLDNIIWEPHSVIAKPKLILDLQDKQMLFEIFDNKESKHLQLHAGAMIITRPVKPSSLGSSEVSGHKYQPGWQFNIANDKFYVNRKVSQQLQSNSNKRMAHGVRVHHSAPALKLQTMKLKLSNKDIANFHRPRAIWYPHDIEVAVRQQGRLPTQGPMKIILKSLGGKGSKLHVDAEETVSSVKAKASKKLDFKPLETVKMFYLGKDLEDDKSLATQNVQPNSLLHLIRTRIHLLPRAQKLQRENKSLRPPGAFKKKSDLSVRDGHIFLMEYCEERPLLLSNPGMGANLCTYYRKASSGDQTGGLLRNGNQTLGNVLLLEPADKSPFLGDIKAGCSQSSLETNMYKAPIFSHKVPSTDFLLVRSAKGKLSIRRIDKIAVVGQQEPLMEVISPGLKNLQTYLMNRMLVYVYREFSAAAKRGLTPFIGTDELFTHFPNLSDAIVRKKLKECAYLRRDKSGRQIWSMKPDFHIPPEGVLRKLVFPEHVCAYESMQAGLYRLKHLGITRLTHPTTISSAMSQLPDEAIALAAASHIERELQITPWSLSSNFVACTSQDRECIERLEITGVGDPSGRGLGFSYVRTAPKAPMSNAMMKKKTAAGRGGSSVTGTDADLRRLSMEAAREVLLKFNVPDEVIAKQTRWHRIAMIRKLSSEQAASGVKVDPTTISKYARGQRMSFLQLQQQTREKCQEIWDRQVQSLSAVDGEENESDSEANSDLDSFAGDLENLLDAEEFEEGEEVNNDNKIDKVDGVKGLKMRRRPSKAQAEEEIEDEAAEAAELCRLLMDDDDEQKKKKKKKNKAVAGDVGLSFGLQSRISSENVQRVKKASTISKQIVGATQPNGSYTTNENIVKDPKDIESRMFKGNLSGKVKGMKKNGMSSTGPLTKVKILGDNVKLFKEKKSSRETFVCGACGQLGHMRTNKNCPKYGEDPELQVDAADFEKPSGKSTSLEPSGLSQLKTMKKKLIPKSATKIAVVEASEGEKSSSNAKAFPLKFKCSTDRPSDKLASGATQSSDYQVTSDPESGIKSVAKVSKIIISNRAKPDEMQVESQKLPFVIRHQMDNDRGQAESHKRSIVIRPPTNMERDQVEPHKPSVVIRPPADKDREQPHKKIIIKRPKEIIDLDQICQEGGTYPEYRKTKKIVELSSFEKHGKQESLRLTEQTARRKAKEEKGWWEEEQKRRNMEMLREERARKLYEEKLRALEERERFAEITRYTEDIRREREEEERQKAKKKKKKKKADIKDDYLEDYRTRRNDRRTLERDRGAKRKPVELGRYGAEYVPPTKRRRGGEVGLSNVLERIAETLRDNTELSYLFLKPVSKKEAPDYLDIIKRPMDLSTIRDKVRRMEYKDREEFRHDVWQIAYNAHIYNDGRNPGIPPLADQLLELCDYLMDEYHQSLCEAEAGIEARDT |